Discrimination of Audio Signals Using Time-Frequency Distributions

نویسنده

  • Abdullah I. Al-Shoshan
چکیده

Humans can discriminate speech from music easily in their mind without any influence of the mixed music. Due to the new techniques of analysis and synthesis of speech signals, the musical signal processing has gained particular weight, and therefore, the classical sound analysis techniques are used in processing music signals. Music art has a long and distinguished history. It goes back to the time of Greek and is developed through centuries in both the musical instruments and melodies. The problem of audio signal classification serves as the fundamental step towards the rapid growth in audio data volume [1], [2], and [3]. There are many kinds of music such as: Classical, Rock, Pop, Disco, Jazz, Country, Latin, Electronic, Arabic, etc. [4], [5], [6], [7], and [8]. Audio signals change continuously and non-deterministically with time [12]. Consequently they are usually characterized as time averages, and their relative amplitude and frequency contents can be easily specified. As an example, speech and music typically have strong low-frequency energy and progressively weaker high-frequency content [9], [10], [11], and [12]. The maximum frequency, fmax, of an audio signal varies according to audio signal kind; fmax equals 22 KHz in CD quality recording, 11 KHz in FM stereo broadcasting, 6 KHz in stereo or multi-loudspeaker recording, 5 KHz in mono-loudspeaker recording, and 4 kHz in the traditional telephone transmitting quality. A generalized frequency spectrum for audio signal is shown in Figure 1, and audio signals can be classified into the following classes: 1Speech signal compounded of single talker in specific time period. 2Completely music signal without any speech component. 3Mixture of single talker speech and background music. 4Songs; mixture of music with a singer voice. 5Singing without music. 6Abnormal music; uses acclaim cadence, single word cadence, human whistle sound, opposite reverberation or any non-music sound that been inserted as a basic tone of the music melody. These cadences cannot be generated by any of the ordinary musical instrument except modern Organ and mainly processed by a major help of computers. 7Speech signal compounded of two or more speakers talking simultaneously in a specific time period. A good algorithm for separating the sounds of two talkers taking simultaneously is using the cepstrum analysis. 8Non-speech and non-music signals: like car, motor, fan sounds, etc. 9Complex sound mixture like multi-speakers or multisingers with multi-music sources.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discrimination of Power Quality Distorted Signals Based on Time-frequency Analysis and Probabilistic Neural Network

Recognition and classification of Power Quality Distorted Signals (PQDSs) in power systems is an essential duty. One of the noteworthy issues in Power Quality Analysis (PQA) is identification of distorted signals using an efficient scheme. This paper recommends a Time–Frequency Analysis (TFA), for extracting features, so-called "hybrid approach", using incorporation of Multi Resolution Analysis...

متن کامل

Pathologies cardiac discrimination using the Fast Fourir Transform (FFT) The short time Fourier transforms (STFT) and the Wigner distribution (WD)

This paper is concerned with a synthesis study of the fast Fourier transform (FFT), the short time Fourier transform (STFT and the Wigner distribution (WD) in analysing the phonocardiogram signal (PCG) or heart cardiac sounds.     The FFT (Fast Fourier Transform) can provide a basic understanding of the frequency contents of the heart sounds. The STFT is obtained by calculating the Fourier tran...

متن کامل

Complex feature analysis of center of pressure signal for age-related subject classification

Purpose: The aim of this study was to characterize prolonged standing and its effect on postural control in elderly individuals in comparison to adults.Materials and Methods: The elderly individuals’ behavior during standing and how demanding such a task is for them, is still unknown. We recorded the center of pressure (COP) position of 12 elder and 15 young participants while they were standin...

متن کامل

Audio Signal Discrimination Using Evolutionary Spectrum

In this paper, a joint-distribution algorithm, mainly, the evolutionary spectrum (ES) for audio signal discrimination is proposed and discussed. The purpose of audio signal discrimination is to build two different libraries: speech library and music library, from a stream of sounds. In general, the classification algorithms can be divided into three approaches: time-domain, frequency-domain, an...

متن کامل

سایکوآکوستیک و درک گفتار در افراد مبتلا به نوروپاتی شنوایی و افراد طبیعی

Background: The main result of hearing impairment is reduction of speech perception. Patient with auditory neuropathy can hear but they can not understand. Their difficulties have been traced to timing related deficits, revealing the importance of the neural encoding of timing cues for understanding speech. Objective: In the present study psychoacoustic perception (minimal noticeable differen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015